NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Towards high quality mobile crowdsensing: Incentive mechanism design based on fine-grained ability reputation

https://doi.org/10.1016/j.comcom.2021.09.026

Luo, Zhuangye; Xu, Jia; Zhao, Pengcheng; Yang, Dejun; Xu, Lijie; Luo, Jian (December 2021, Computer Communications)

Full Text Available
Biobjective Robust Incentive Mechanism Design for Mobile Crowdsensing

https://doi.org/10.1109/JIOT.2021.3072953

Xu, Jia; Zhou, Yuanhang; Ding, Yuqing; Yang, Dejun; Xu, Lijie (October 2021, IEEE Internet of Things Journal)

Full Text Available
Incentive Mechanism Design for Truth Discovery in Crowdsourcing with Copiers

https://doi.org/10.1109/TSC.2021.3075741

Jiang, Lingyun; Niu, Xiaofu; Xu, Jia; Yang, Dejun; Xu, Lijie (April 2021, IEEE Transactions on Services Computing)

Full Text Available
Sync-Switch: Hybrid Parameter Synchronization for Distributed Deep Learning

Li, Shijian; Mangoubi, Oren; Xu, Lijie; Guo, Tian (January 2021, 41th IEEE International Conference on Distributed Computing Systems (ICDCS'21))
null (Ed.)
Full Text Available
DistStream: An Order-Aware Distributed Framework for Online-Offline Stream Clustering Algorithms

Xu, Lijie; Ye, Xingtong; Kang, Kai; Guo, Tian; Dou, Wensheng; Wang, Wei; Wei, Jun (January 2020, 40th IEEE International Conference on Distributed Computing Systems (ICDCS'20))

Stream clustering is an important data mining technique to capture the evolving patterns in real-time data streams. Today’s data streams, e.g., IoT events and Web clicks, are usually high-speed and contain dynamically-changing patterns. Existing stream clustering algorithms usually follow an online-offline paradigm with a one-record-at-a-time update model, which was designed for running in a single machine. These stream clustering algorithms, with this sequential update model, cannot be efficiently parallelized and fail to deliver the required high throughput for stream clustering. In this paper, we present DistStream, a distributed framework that can effectively scale out online-offline stream clustering algorithms. To parallelize these algorithms for high throughput, we develop a mini-batch update model with efficient parallelization approaches. To maintain high clustering quality, DistStream’s mini-batch update model preserves the update order in all the computation steps during parallel execution, which can reflect the recent changes for dynamically-changing streaming data. We implement DistStream atop Spark Streaming, as well as four representative stream clustering algorithms based on DistStream. Our evaluation on three real-world datasets shows that DistStream-based stream clustering algorithms can achieve sublinear throughput gain and comparable (99%) clustering quality with their single-machine counterparts.
more » « less
Full Text Available
Incentivizing the Workers for Truth Discovery in Crowdsourcing with Copiers

Jiang, Lingyun; Niu, Xiaofu; Xu, Jia; Yang, Dejun; Xu, Lijie (July 2019, IEEE International Conference on Distributed Computing Systems (ICDCS))

Full Text Available
Incentive Mechanism for Multiple Cooperative Tasks with Compatible Users in Mobile Crowd Sensing via Online Communities

https://doi.org/10.1109/TMC.2019.2911512

Xu, Jia; Rao, Zhengqiang; Xu, Lijie; Yang, Dejun; Li, Tao (April 2019, IEEE Transactions on Mobile Computing)

Full Text Available
Speeding up Deep Learning with Transient Servers

Li, Shijian; Walls, Robert J.; Xu, Lijie; Guo, Tian (January 2019, 2019 IEEE International Conference on Autonomic Computing (ICAC))

Distributed training frameworks, like TensorFlow, have been proposed as a means to reduce the training time of deep learning models by using a cluster of GPU servers. While such speedups are often desirable—e.g., for rapidly evaluating new model designs—they often come with significantly higher monetary costs due to sublinear scalability. In this paper, we investigate the feasibility of using training clusters composed of cheaper transient GPU servers to get the benefits of distributed training without the high costs. We conduct the first large-scale empirical analysis, launching more than a thousand GPU servers of various capacities, aimed at understanding the characteristics of transient GPU servers and their impact on distributed training performance. Our study demonstrates the potential of transient servers with a speedup of 7.7X with more than 62.9% monetary savings for some cluster configurations. We also identify a number of important challenges and opportunities for redesigning distributed training frameworks to be transient-aware. For example, the dynamic cost and availability characteristics of transient servers suggest the need for frameworks to dynamically change cluster configurations to best take advantage of current conditions.
more » « less
Full Text Available
An Experimental Evaluation of Garbage Collectors on Big Data Applications

Xu, Lijie; Guo, Tian; Dou, Wensheng; Wang, Wei; Wei, Jun (January 2019, The 45th International Conference on Very Large Data Bases (VLDB'19))

Popular big data frameworks, ranging from Hadoop MapReduce to Spark, rely on garbage-collected languages, such as Java and Scala. Big data applications are especially sensitive to the effectiveness of garbage collection (i.e., GC), because they usually process a large volume of data objects that lead to heavy GC overhead. Lacking in-depth understanding of GC performance has impeded performance improvement in big data applications. In this paper, we conduct the first comprehensive evaluation on three popular garbage collectors, i.e., Parallel, CMS, and G1, using four representative Spark applications. By thoroughly investigating the correlation between these big data applications’ memory usage patterns and the collectors’ GC patterns, we obtain many findings about GC inefficiencies. We further propose empirical guidelines for application developers, and insightful optimization strategies for designing big-data-friendly garbage collectors.
more » « less
Full Text Available
Online Incentive Mechanism for Mobile Crowdsourcing Based on Two-Tiered Social Crowdsourcing Architecture

https://doi.org/10.1109/SAHCN.2018.8397102

Xu, Jia; Guan, Chengcheng; Wu, Haobo; Yang, Dejun; Xu, Lijie; Li, Tao (June 2018, IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks (SECON))

Full Text Available

Search for: All records